Speaker adaptation for non-native speakers using bilingual English lexicon and acoustic models
نویسندگان
چکیده
This paper proposes a supervised speaker adaptation method that is effective for both non-native (i.e. Japanese) and native English speakers’ pronunciation of English speech. This method uses English and Japanese phoneme acoustic models and a pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The same utterances are used for adaptation of both acoustic models. A recognition system uses these two adapted acoustic models and the lexicon, and the highest-likelihood word sequence obtained in combining with Englishand Japanesepronounced words is the recognition result. Continuous speech recognition experiments show that the proposed adaptation method greatly improves both Japanese-English and nativeEnglish recognition performance, and the system using bilingual adapted models achieves the highest accuracy for Japanese speakers among those using monolingual models, while maintaining the same performance level for native speakers as that of an English recognition system using an English adapted model.
منابع مشابه
Non-native English speech recognition using bilingual English lexicon and acoustic models
This paper proposes an English speech recognition system which can recognize both non-native (i.e. Japanese) and native English speakers’ pronunciation of English speech. The system uses a bilingual pronunciation lexicon in which each word has both English and Japanese phoneme transcriptions. The Japanese transcription is constructed considering typical Japanese pronunciation of English. Japane...
متن کاملSpeaker adaptation method for CALL system using bilingual speakers' utterances
Several CALL systems have two acoustic models to evaluate a learner’s pronunciation. In order to achieve high performance for evaluation, speaker adaptation method is introduced in CALL system. It requires adaptation data of a target language, however, a learner cannot pronounce correctly. In this paper, we proposed two types of new speaker adaptation methods for CALL system. The new methods on...
متن کاملComparison of acoustic model adaptation techniques on non-native speech
The performance of speech recognition systems is consistently poor on non-native speech. The challenge for non-native speech recognition is to maximize the recognition performance with small amount of non-native data available. In this paper we report on the acoustic modeling adaptation for the recognition of non-native speech. Using non-native data from German speakers, we investigate how bili...
متن کاملSpeech recognition for multiple non-native accent groups with speaker-group-dependent acoustic models
In this paper, the recognition performance for non-native English speech with two different kinds of speaker-groupdependent acoustic models is investigated. The approaches for creating speaker groups include knowledge-based grouping of non-native speakers by their first language, and the automatic clustering of speakers. Clustering is based on speakerdependent acoustic models in speaker Eigensp...
متن کاملSpeaker Adaptation Method for CALL Systems Using Bilingual Speakers’ Utterances
Several CALL systems have two acoustic models to evaluate a learner’s pronunciation. In order to achieve high performance for evaluation, speaker adaptation method is introduced in CALL system. It requires adaptation data of a target language, however, a learner cannot pronounce correctly. In this paper, we proposed two types of new speaker adaptation methods for CALL system. The new methods on...
متن کامل